A Class Of Separate-Type Estimators For Population 
Mean In Stratified Sampling Using Known Parameters 
Under Non-Response 


‘Manoj K. Chaudhary, Rajesh Singh , and Mukesh Kumar, “Rakesh K. Shukla 
and “Florentin Smarandache 
1234Department of Statistics, Banaras Hindu University 
Varanasi-221005, INDIA 


°Chair of Department of Mathematics, University of New Mexico, 
Gallup, USA 


‘ritamanoj15@gmail.com,*rsinghstat@gmail.com,’Mukesh.stat@gamil.com, 
‘timgyanu@gamil.com *smarand@unm.edu 


ABSTRACT 


The objective of the present paper is to propose a family of separate-type 
estimators of population mean in stratified random sampling in presence of non- 
response based on the family of estimators proposed by Khoshnevisan et al. 
(2007). Under simple random sampling without replacement (SRSWOR) the 
expressions of bias and mean square error (MSE) up to the first order of 
approximation are derived. The comparative study of the family with respect to 
usual estimator has been discussed. The expressions for optimum sample sizes 
of the strata in respect to cost of the survey have also been derived. An 


empirical study is carried out to shoe the properties of the estimators. 
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1. INTRODUCTION 

Upadhyaya and Singh (1999) have suggested the class of estimators in 
simple random sampling using some known population parameter(s) of an 
auxiliary variable. These estimators have been extended by Kadilar and Cingi 
(2003) for stratified random sampling. In an attempt to improve the estimators, 
Kadilar and Cingi (2005), Shabbir and Gupta (2005, 2006) and Singh and 
Vishwakarma (2008) have suggested new ratio estimators in stratified random 
sampling. Using power transformation Singh et al. (2008) have suggested a class 
of estimators adapting the estimators developed by Kadilar and Cingi (2003). 
Koyuncu and Kadilar (2008, 2009) have proposed a family of combined-type 
estimators in stratified random sampling based on the family of estimators 
proposed by Khoshnevisan et al. (2007). Singh et al. (2008) suggested some 
exponential ratio type estimators in stratified random sampling. Recently 
Koyuncu and Kadilar (2010) have suggested a family of estimators in stratified 
random sampling following Diana (1993) and Kadilar and Cingi (2003). 


Let YandXbe the study and auxiliary variables respectively, with 


respective population means YandX. Khoshnevisan et al. (2007) have 
proposed a family of estimators for population mean using known values of some 


population parameters in simple random sampling (SRS) given by 
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(1.1) 


where a+0, b are either real numbers or functions of known parameters of the 
auxiliary variable X . 

In this paper, we have proposed a family of separate-type estimators of 
population mean in stratified random sampling in presence of non-response on 
study variable adapting the above family of estimators. The properties of the 
proposed family of estimators in comparison with usual estimators have been 
discussed. The expressions for optimum sample sizes of the strata with respect 
to cost of the survey have been obtained. 


2. SAMPLING STRATEGY AND ESTIMATION PROCEDURES 
Let us consider a finite population of size, N , is divided into k strata. Let 


N,be the size of i” stratum (i = 1,2,...,k) and a sample of size n, is drawn from the 


k 
i” stratum using SRSWOR scheme such that n, =n. It is assumed that the 


i=l 
non-response is detected on study variable Y only and auxiliary variable X is free 


from non-response. 
—* = 
Let y; and x; are the unbiased estimators of population means Y; and 
Xi respectively, for the i” stratum, given as 


RY +n,y, 
y; = Na Yni Nin Y yin (2.1) 





n; 
where y,,and y,., are the means based on n,,response units and u, non- 
response units of sub sample selected from n,, non-response units respectively. 
x; be the sample mean based on n, units. 


Therefore an unbiased estimator of population mean Y is given by 
hy ae oss 
Ya =D Pi; (2.2) 
i=l 
and variance of the estimator is expressed as 


—* a at eee | k (k,-1) 
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where S^ and Sf, are respectively the mean-squares of entire group and non- 


response group of study variable in the population for the i” stratum. k, eens 
Uiz 


m , f N, 
p= ai and W,, =non-response rate of the i” stratum in the population= ae 
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2.1 SUGGESTED FAMILY OF ESTIMATORS 
Adapting the idea of Khoshnevisan et al. (2007), we propose a family of 


separate-type estimators of population meanY , given by 
T; =) pT; (2.4) 


where T) =y | —= OAE (2.5) 
&æ(axi+b)+(1-&)(aX i +b) 





Obviously, T, is biased forY. Therefore, bias and MSE of T, can be 
obtained on using large sample approximations. Let 
y, =Y;(I+e,) ; xi =X;(l+e,) 


such that E(e,)=E(e,)=0 and 


E(e2)= vb) J. =f, C? E U 
Y; n, Yi 
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Covly, , Xi 
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N; =n; =n, 
where = , C= 
f= Wip 


Si Se , 

Æ, Cy, =—t, Sy, be the mean-square of entire 
i Xi 

group of auxiliary variable in the population for the i” stratum and p,is the 

correlation coefficient between Y and X in the i” stratum. 


Expressing the estimator T, in terms of e, ande, , we get 


T, = Yay: Yi(l+e, [l+aa,e,|° (2.6) 
where joecass on 
aX, +b 


Suppose |a@/,e,|< 1 so that [1+ a4,e,]® is expandable. Expanding the right 


hand side of the equation (2.6) up to the first order of approximation, we obtain 





= ker 
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i=l 
Taking expectations of both sides of (2.7), we get the bias of 7,up to the 
first order of approximation, as 


Yip fy see set) g a Ci, 22, 90.0,C. | (2.8) 


Sf 
Squaring both side of equation (2.7) and taking expectations on both sides of this 


equation, we get the MSE(7, ) to the first order of approximation as given below: 


(k, = 1) 





MSE(T oy s7 Yi te +a? oe Ce - 20d, ¢p,CyC x; )+ W, 2S v2 (2.9) 


i=l 


2.2 SOME SPECIAL CASES 
Case 1: If we puta =1, a=1,b=0 and g =1 in equation (2.4), we get 


k a X; 
T; Dooa (2.10) 


which is separate ratio estimator of population mean Y under non-response. 


Case 2: If ~=1, a=1,b=0 and g =-1, the equation (2.4) gives 





k =N 
Ts =), py > (2.11) 


which is separate product estimator of population mean Y under non-response. 
Case 3: If we takea =0, a=0,b=0 and g =0, the equation (2.4) provides 


k 
Ts = DOP: (2.12) 


which is the usual estimator of population mean Y under non-response. 
Similarly, we can obtain the various existing estimators of the family 
under non-response on different choices of@, a, b and g [ See Khoshnevisan et 


al. (2007)]. 


2.3 OPTIMUM CHOICE OF @ 
In order to obtain the optimum œ we minimize MSE(T,) with respect to 


a. Differentiating MSE(T,) with respect to œ and equating the derivative to 


zero, we get the normal equation 


OMSE\|T k = 
OMSEIL) L S p? fF? pad}, - 204, ¢p,C,Cul=0 (2.13) 
a i=l 
k= 
Èp, SiYi p/CyCy, 
> Qp == (2.14) 


k = 
AgS pef YC 

i=l 
Thus the equation (2.14) provides the value of œ at which MSE(T, )would 


be minimum. 


2.4 OPTIMUM n, WITH RESPECT TO COST OF THE SURVEY 
Let C, be the cost per unit of selecting n, units, C, be the cost per unit in 
enumerating n,,units and C,, be the cost per unit of enumerating u, units. Then 
the total cost for the i” stratum is given by 
C; = 6 nt Cana FC alts V i=1,2,...,k (2.15) 


Now, we consider the average cost per stratum 


E(C,)= nf +C,W,, +C,, a (2.16) 
Thus the total cost over all the strata is given by 
Cae SEC.) 
=I 
k w 
= Sn) Cu +C Wa +C, a (2.17) 


Let us consider the function 
¢ = MSE(T, )+ uC, (2.18) 


where uw is Lagrangian multiplier. Differentiating the equation (2.18) with respect 
to n; and k, respectively and equating the derivatives to zero, we get the 


following normal equations: 


2 j; 
we Pic; +a’ ae Ci - 20, 8P:C xC xi )+ (k; —1)W,S yi + 
W. 
dc, + CW, + Ci, a) =0 (2.19) 
o P: WaS i2 W 
= Fi C,, —2- = 0 2.20 
Ok, n, i~i2 k? ( ) 


From the equations (2.19) and (2.20), we have 
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and Jy = E 2Sr (2.22) 


ie 


Putting the value of the ʻu from equation (2.22) into the equation (2.21), 


we get 


JC, B, 
i(opt) = ee (2.23) 
Syn; 


where A, = ./C,, +C,,W,, 





and B, = We E. + aA g Chi = 200, gPCy,C xi )- W352, 


On substituting k») into equation (2.21), n, can be expressed as 





B? aa V Cn BWS yn 


A, 
n, = : (2.24) 
+ Ci AWS yin 





The Juin terms of total cost C, can be obtained by putting the values of 











kion and n, from equations (2.23) and (2.24) respectively into equation (2.17) as 
i(opt) q 

1 k 
Ju = à Pi (A;B, h Ca W,aSna) (2.25) 

0 i=l 

Thus the n, can be expressed in terms of the total cost C, as 

paja l CoB WS) 

Ni(opt) z k : = (2.26) 
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The optimum values of n, and k, can be obtained by the expressions 


(2.26) and (2.23) respectively. 


3. EMPIRICAL STUDY 
In this section, we use the data set in Koyuncu and Kadilar (2009). The 
data concerning the number of teachers as study variable and the number of 
students as auxiliary variable in both primary and secondary school for 923 
districts at 6 regions (as 1: Marmara, 2: Agean, 3: Mediterranean, 4: Central 
Anatolia, 5: Black Sea, 6: East and Southeast Anatolia) in Turkey in 2007 
(Source: Ministry of Education Republic of Turkey). 


Table 1: Stratum means, Mean Squares and Correlation 


Coefficients 


























Stratu = — 
N, Mi Yi Xi Sy; S yi S xvi Pi Sn 
m No. 
12 | 3 | 703.7 | 20804.5 30486.75 | 25237153.5 | .93 
1 883.835 440 
7 1 4 9 1 2 6 
11 2 | 413.0 15180.76 .99 
2 9211.79 | 644.922 9747942.85 200 
7 1 0 9 6 
A 10 | 2 | 573.1 | 14309.3 | 1033.46 | 27549.69 | 28294397.0 | .99 ae 
3 9 7 0 7 7 4 4 
17 | 3 | 424.6 18218.93 | 14523885.5 | .98 
4 9478.85 | 810.585 405 
0 8 6 1 3 3 
20 | 2 | 527.0 .98 
5 5569.95 | 403.654 | 8497.776 | 3393591.75 180 
5 2 3 9 
20 | 3 | 393.8 | 12997.5 23094.14 | 15864573.9 | .96 
6 711.723 300 
1 9 4 9 1 7 5 
































Table 2: Percent Relative Efficiency (P.R.E.) of 7, with respect to Vy at 


Qopt) = 0.9317, a =land b=1 





















































W, k, P.R.E(T, ) 
2.0 1319.17 
2.5 1153.15 
0.1 3.0 1026.92 
3.5 927.72 
2.0 1026.92 
2.5 847.70 
0.2 3.0 726.55 
3.5 639.18 
2.0 847.70 
2.5 679.59 
0.3 3.0 573.20 
3.5 499.81 
2.0 726.55 
2.5 573.20 
0.4 3.0 480.16 











417.69 











4. CONCLUSION 
In this paper, a class of separate-type estimators for estimating the 
population mean in stratified random sampling under non-response has been 
proposed and method of finding the optimum estimator of the family has also 
been discussed. We have derived the expressions for optimum sample sizes in 
respect to cost of the survey. From the Table 2, it is easily observed that the 


optimum estimator of the proposed class T, provides better estimate than usual 


estimator y, under non-response. It is also observed that the relative efficiency 


of T, decreases with increase in the non-response rate W,, andk,. 
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